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Sir: 

In connection with a Sequence Listing submitted concurrently herewith for insertion 
into the captioned specification, the undersigned hereby declares that the material being 
inserted by amendment is the material previously incorporated by reference and that the 
amendment contains no new matter. 



The sequence listing submitted herewith includes the sequences from GenBank 
accession ntmibers disclosed in the captioned application and priority documents EP 
02008199.8 and EP 02015499.3. Specifically, the sequence Hsting includes the Smac 
polypeptide of GenBank accession no. AAF87716 (SEQ ID NO:l), the polynucleotide 
encoding the Smac polypeptide as disclosed in GenBank accession no. AF262240 (SEQ ID 
NO:2), the HIV-l tat polypeptide of GenBank accession no. CAA45921 (SEQ ID NO:3) and 
the polynucleotide encoding the HIV-1 Genome of GeneBank accession no. M15654 (SEQ 
ID NO:4). 
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The GenBank accession numbers referred to above may be found on page 4, line 15; 
page 6, lines 9-10; and page 16, line 1 of the International Publication No. WO 03/086470, 
which corresponds to the present application. Additionally, the GenBank accession numbers 
AAF87716, AF262240; and CAA45921 may be found in priority document EP 02015499.3 
on page 2, line 28; page 6, lines 4-5 and page 12, lines 15-16. Finally, the GenBank accession 
nimibers AAF87716, AF262240; and Ml 5654 may be found in priority document EP 
02008199.8 on page 2, line 28; page 6, line 5 and page 15, line 4. 

Submitted herewith as Exhibits 1-4 are copies of each of the GenBank entries and 
their respective sequence revision histories. Genbank accession numbers AAF87716 and 
AF262240 have never been revised since their original GenBank submission. See Exhibits 1- 
2. GenBank accession numbers CAA45921 and Ml 5654 have been revised, however the 
version numbers of the sequences have remained the same as indicated on the sequence 
revision history of each submission. See Exhibits 3 and 4. The revisions indicated in the 
histories for GenBank accession nimibers CAA45921 and Ml 5654 do not relate to sequence 
revisions. A revision of a sequence itself would have changed the version number of the 
sequence, according to the National Center for Biotechnology Information (NCBI), As such, 
the undersigned believes that the amendment to add the sequence listing filed herewith does 
not introduce new matter. 

» 

I further declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent resulting therefi-om. 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 
FEATURES 

source 



239 aa 



linear PRI 26-JUL-2000 



Protein 



transit 



Region 



CDS 



AAF87716 
Smac [Homo sapiens] . 
AAF87716 

AAF87716 .1 GI : 94 54 219 

locus AF262240 accession AF262240 . 1 

Homo sapiens (human) 

^P'? P _sap i ens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini ; Hominidae ; Homo . 

1 (residues 1 to 239) 

Du,C., Fang,M., Li,Y., Li,L. and Wang,X. 

Smac, a mitochondrial protein that promotes cytochrome c -dependent 
caspase activation by eliminating lAP inhibition 
Cell 102 (1), 33-42 (2000) 
10929711 

2 (residues 1 to 23 9) 

Du,C., Fang,M., Li,Y. and Wang,X. 
Direct Submission 

Submitted ( Ol-MAY-2000 ) Howard Hughes Medical Institute and 
Department of Biochemstry, University of Texas Southwestern Medical 
Center, 5323 Harzry Hines Blvd., Dallas, TX 75235, USA 
Method: conceptual translation supplied by author. 

Location/ Qualifiers 
1. .239 

/organism^ "Homo sapiens" 
/db_xref=:"taxon: 9606" 
1. .23 9 

/products " Smac " 

/function^ "binds lAPs and neutralizes their inhibition on 
caspase activation and activity" 

/name= "antagonist of lAPs (inhibitors of apoptosis) " 
peptide 1 . . 55 

/note= "mitochondrial targeting sequence" 
6 . .239 

/ r e g i on_name = " Sma c_D I ABLO " 

/note= "Second Mitochondria -derived Activator of Caspases; 
pfam09057" 

/db_xref = "CDD : 72475 " 
1. .239 

/coded_by="AF2 6224 0 . 1:20, .739" 
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1 maalkswlsr svtsffryrq clcvpwanf kkrcfselir pwhktvtigf gvtlcavpia 
61 qksephslss ealmrravsl vtdststfls qttyalieai teytkavytl tslyrqytsl 
121 Igkmnseeed evwqviigar aemtskhqey Iklettwmta vglsemaaea ayqtgadqas 
181 itarnhiqlv klqveevhql srkaetklae aqieelrqkt qeegeeraes eqeaylred 
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Jul 26 2000 12:05 AM 
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Accession AAF87716 was first seen at NCBI on Jul 26 2000 12:05 AM 
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Features Sequence 



Links 



LOCUS 

DEFINITION 
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VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AF262240 1358 bp mRNA linear PRI 26-'JUL-2000 

Homo sapiens Smac mRNA, complete cds; nuclear gene for 
mitochondrial product. 
AF262240 

AF262240 .1 GI : 9454218 

Homo sapiens (human) 

Homo s ap iens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini ; 
Catarrhini ; Hominidae; Homo. 
1 (bases 1 to 1358) 

Du , C . , Fang , M . , Li , Y . , Li , L . and Wang , X . 

Smac, a mitochondrial protein that promotes cytochrome c-dependent 
caspase activation by eliminating lAP inhibition 
Cell 102 (1), 33-42 (2000) 
10929711 



FEATURES 

source 



CDS 



2 (bases 1 to 1358) 
Du , C . , Fang , M . , Li , Y . and Wang , X . 

Direct Submission 

Submitted ( Ol-MAY-2000) Howard Hughes Medical Institute and 
Department of Biochemstry, University of Texas Southwestern Medical 
Center, 5323 Harry Hines Blvd., Dallas, TX 75235, USA 
Location/Qualifiers 
1 . .1358 

/organism= "Homo sapiens" 
/mol_type= "mRNA" 
/db_xref « " taxon : 9606 " 
20. .739 

/function^ "binds lAPs and neutralizes their inhibition on 
caspase activation and activity" 

/note= "antagonist of lAPs (inhibitors of apoptosis) " 

/codon_start=l 

/products " Smac " 

/protein_id= " AAF8 7 716 . 1" 

/db_xref = "GI : 9454219" 

/translation="MAALKSWLSRSVTSFFRYRQCLCVPWANFKKRCFSELIRPWHK 
TVTIGFGVTLCAVPIAQKSEPHSLSSEALMRRAVSLVTDSTSTFLSQTTYALIEAITE 
YTKAVYTLTSLYRQYTSLLGKMNSEEEDEVWQVIIGARAEMTSKHQEYLKLETTWMTA 
VGLSEMAAEAAYQTGADQASITARNHIQLVKLQVEEVHQLSRKAETKLAEAQIEELRQ 
KTQEEGEERAESEQEAYLRED " 
t ransit peptide 2 0. .184 

/note= "mitochondrial targeting sequence" 
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1 ggcgtccgcg cgctgcacaa tggcggctct gaagagttgg ctgtcgcgca gcgtaacttc 
61 attcttcagg tacagacagt gtttgtgtgt tcctgttgtg gctaacttta agaagcggtg 
121 tttctcagaa ttgataagac catggcacaa aactgtgacg attggctttg gagtaaccct 
181 gtgtgcggtt cctattgcac agaaatcaga gcctcattcc cttagtagtg aagcattgat 
241 gaggagagca gtgtctttgg taacagatag cacctctacc tttctctctc agaccacata 
301 tgcgttgatt gaagctatta ctgaatatac taaggctgtt tataccttaa cttctcttta 
361 ccgacaatat acaagtttac ttgggaaaat gaattcagag gaggaagatg aagtgtggca 
421 ggtgatcata ggagccagag ctgagatgac ttcaaaacac caagagtact tgaagctgga 
481 aaccacttgg atgactgcag ttggtctttc agagatggca gcagaagctg catatcaaac 
541 tggcgcagat caggcctcta taaccgccag gaatcacatt cagctggtga aactgcaggt 
601 ggaagaggtg caccagctct cccggaaagc agaaaccaag ctggcagaag cacagataga 
661 agagctccgt cagaaaacac aggaggaagg ggaggagcgg gctgagtcgg agcaggaggc 
721 ctacctgcgt gaggattgag ggcctgagca cactgccctg tctccccact cagtggggaa 
781 agcaggggca gatgccaccc tgcccagggt tggcatgact gtctgtgcac cgagaagagg 
841 cggcaggtcc tgccctggcc aatcaggcga gacgcctttg tgagctgtga gtgcctcctg 
901 tggtctcagg cttgcgctgg acctggttct tagcccttgg gcactgcacc ctgtttaaca 
961 tttcacccca ctctgtacag ctgctcttac ccattttttt tacctcacac ccaaagcatt 
1021 ttgcctacct gggtcagaga gaggagtcct ttttgtcatg cccttaagtt cagcaactgt 
1081 ttaacctgtt ttcagtctta tttacgtcgt caaaaatgat ttagtacttg ttccctctgt 
1141 tgggatgcca gttgtggcag ggggagggga acctgtccag tttgtacgat ttctttgtat 
1201 gtatttctga tgtgttctct gatctgcccc cactgtcctg tgaggacagc tgaggccaag 
1261 gagtgaaaaa cctattacta ctaagagaag gggtgcagag tgtttacctg gtgctctcaa 
13 21 caggacttaa catcaacagg acttaacaca gaaaaaaa 
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VERSION 

DBSOURCE 

KEYWORDS 
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REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CAA45921 



86 aa 



linear 



VRL 2 0 -NOV- 1992 



Protein 



Region 



CDS 



HIV-1 tat [Human immunodeficiency virus 1] . 

CAA4 5 921 

CAA45921.1 GI:60145 
embl accession X64650 . 1 

Human immunodeficiency virus 1 (HIV-1) 
Huma n i mmunqde f i c i en cy virus 1 

Viruses; Retro- transcribing viruses; Retroviridae ; 
Orthoretrovirinae ; Lentivirus; Primate lentivirus group. 

1 (residues 1 to 86) 

Siderovski,D.P. , Matsuyama, T. , Frigerio,E., Chui,S., Min,X., 
Erfle,H., Sumner- Smith, M . , Barnett,R.W. and Mak,T.W. 

Random mutagenesis of the human immunodeficiency virus type-1 
trans-activator of transcription (HIV-1 Tat) 
Nucleic Acids Res. 20 (20), 5311-5320 (1992) 
14 3 7550 

2 (residues 1 to 86) 
Siderovski , D . P . 
Direct Submission 

Submitted ( 2 6 - FEB- 1992 ) D.P. Siderovski, Dept of Medical 
Biophysics, University of Toronto, 500 Sherbourne Street, Toronto 
Ontario M4X 1K9, CANADA 

Location/Qualifiers 

1 . . 86 

/organism^ "Human immunodeficiency virus 1" 
/db_xref = " taxon : 11676 " 
1..86 

/products "HIV-1 tat" 
2 . . >47 

/region_name= "Tat " 

/note= "Transact ivating regulatory protein (Tat) . The 
retroviral Tat protein binds to the Tar RNA. This 
activates transcriptional initiation and elongation from 
the LTR promoter. Binding is mediated by an arginine rich 

region; pfam0053 9" 
/db_xref = "CDD : 84 84 7 " 
1. .86 

/gene= "HIV-1 tat" 
/coded_by="X64650,l:39. .299" 
/db xref ="InterPro: IPRO 01831" 
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Dead 
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Dead 
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Links 



LOCUS 

DEFINITION 
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VERSION 
KEYWORDS 



SEGMENT 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
COMMENT 



HIVBH102 8932 bp ss-RNA linear VRL 02-AUG-2006 

Human immunodeficiency virus type 1, isolate BHIO, genome. 
M15654 K02008 K02009 K02010 
M15654 .1 GI : 326383 

TAR region; acquired immune deficiency syndrome; env protein; gag 
protein; long terminal repeat (LTR) ; pol protein; polyprotein; 
proviral gene; reverse transcriptase; transactivator . 
2 of 2 

Human immunodeficiency virus 1 (HIV-1) 

Human immunode f i c i e ncy v i r u s i 

Viruses; Retro- transcribing viruses; Retroviridae ; 
Orthoretrovirinae; Lentivirus; Primate lentivirus group. 

1 (bases 1 to 8932) 

Wong-Staal, F. , Gallo,R,C., Chang, N.T., Ghrayeb,J., Papas, T.S., 
Lautenberger , J. A. , Pearson, M.L. , Petteway, S.R. Jr . , Ivanof f ,L. , 
Baumeister , K. , Whitehorn, E . A. , Raf alski , J. A. , Doran, E.R. , 
Josephs, S.J. , Starcich,B., LivaJc,K.J., Patarca,R., Haseltine , W. A. 
and Ratner,L. 

Complete nucleotide sequence of the AIDS virus, HTLV-III 

Nature 313 (6000), 277-284 (1985) 

2578615 

2 (bases 1 to 8932) 

van Beveren, C. P. , Coffin, J. and Hughes, S. 
Appendix B: HTLV-3 genome 

(in) Weiss, R.L., Teich,N., Varmus,H. and Coffin, J. (Eds.); 
RNA TUMOR VIRUSES, MOLECULAR BIOLOGY OF TUMOR VIRUSES, SECOND 
EDITION, 2; 

Cold Spring Harbor Laboratory, CSH, NY (1985) 

3 (sites) 

Hostomsky , Z . , Hostomska , 2 . , Hudson, G.O., Moomaw,E.W. and Nodes >B.R. 
Reconstitution in vitro of RNase H activity by using purified 
N- terminal and C- terminal domains of human immunodeficiency virus 
type 1 reverse transcriptase 

Proc. Natl. Acad. Sci. U.S.A. 88 (4), 1148-1152 (1991) 

1 705027 

Original source text: Human immunodeficiency virus type 1 (HIV-1) 
proviral DNA clone BHIO. 

[(in) Weiss, R., Teich,N., Varmus,H. and Coffin, J. (Eds.);RNA Tumor 
Viruses , Molecu] review; bases 1 to 8 932. 

The BHIO sequence differs from BH8 and BH5 by 0.9% in the coding 
regions and 1.8% in the noncoding regions, and the authors of [1] 
believe that these are stable variants. 

The HTLV-III genome encodes at lea£ t seven proteins: gag, pol, env, 
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tat, trs, 27K antigen and the sor 23K product. The 3' ORF 
(positions 8153-8773) is truncated in BHIO (stop codon at positions 
8522-8524), but reads through in BH8 and other sequences to yield 
what is now called the 27K antigen. 

The mechanism for pol gene translation has not been elucidated: a 
gag-pol fusion protein is possible; splicing or frameshift have not 
been ruled out . The viral protease would be determined by the 
region in question. 

The Tat protein (trans-activator protein, approximately 14 kb) is 
an effector of an autostimulatory pathway through interaction with 
a positive control element, the trans -activating responsive 
sequence, TAR, Tat seems to be a transcriptional control molecule 
in HTLV-I, but is both that and a post - transcript ional regulatory- 
molecule in HTLV-III. Deletion mutants in the tat gene are 
incapable of prolific replication and exhibit no cytopathic effects 
in T4+ cell lines. 
In addition to the 

9.4 kb genomic mRNA, subgenomic mRNAs of 7.4, 5.5, 5.0, 4,3, 2.0 
and 1.8 kb have been detected, 
FEATURES Location/Qualifiers 
source 1 . . 8932 

/organisms: "Human immunodeficiency virus 1" 

/mo l_type= "genomic RNA" 

/db_xref ="taxon: 1 16 76 " 
prim transcript <1..>8932 

/not e= "genomic mRNA" 
pr im__t r ans c r ip t < 1 . . > 8 9 3 2 

/note="tat, rev, nef subgenomic mRNA" 
intrqn 66 . .5135 

/note="tat, rev, nef subgenomic mRNA intron 1" 
gene 112.. 1650 

/gene="gag" 
CDS 112 . . 1650 

/ gene = "gag" 

/ codon_start=l 

/products "gag polyprotein precursor" 
/protein_id== " AAA44 2 01 .1 " 
/db_xre f = « GI : 3 2 6 3 8 8 " 

/ 1 rans la t ion=: " MGARAS VLSGGELDRWEKIRLRPGGKKKYKLKH I VWASRELERF 

AVNPGLLETSEGCRQILGQLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALD 

KIEEEQNKSKKKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQAISPRTLNAWVKVVE 

EKAFSPEVIPMFSALSEGATPQDIJ^ITMmTVGGHQAAMQMLKETINEEAAEWDRVHPV 

HAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRM 

YSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKTIL 

KALGPAATLEEMMTACQGVGG PGHKARVIJVEAMSQVTNTAT IMMQRGNFRNQRKJ^^ 

FNCGKEGHTARNCRAPRKKGCWKCGKEGHQMKDCTERQANFLGKIWPSYKGRPGNFLQ 

SRPEPTAPPFLQSRPEPTAPPEESFRSGVETTTPPQKQEPIDKELYPLTSLRSLFGND 

PSSQ" 

gene <1407..4454 

/gene= "pol " 
CDS <1407 . .4454 

/gene= "pol " 

/note="ORF for pol polyprotein containing stop codons" 

/codon_start=l 

/products "pol polyprotein" 

/protein_id= " AAA44198 . 1 " 

/db_xref ="GI : 326385" 

/ 1 r ans 1 a t i on= " FFREDLAFLQGKAREFS S EQTRANS PT I S S EQTRANS PTRRELQ 
VWGRDNNSPSEAGADRQGTVSFNFPQITLWQRPLVTIKIGGQLKEAIiLDTGADDTVLE 
EMSLPGRWKPKMIGGIGGFIKVRQYDQILIEICGHKAIGTVLVGPTPVNIIGRNLLTQ 
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IGCTIiNFPISPIETVPVKLKPGMDGPKVKQWPIiTEEKiKALVEICTEMEKEGKISKIG 

PENPYNTPVFAIKKJODSTKWRKIjVDFRELNKRTQDFWEVQLGIPHPAGLKKKKSVTVL 

DVGDAYFSVPLDEDFRKYTAFTIPSINNETPGIRYQYNVLPQGWKGSPAIFQSSMTKI 

liEPFKKQNPDIVIYQYMDDIiYVGSDLEIGQHRTKIEELRQHLLRWGIiTTPDKKHQKEP 

PFLWMGYEIiHPDKWTVQPIVLPEKDSWTVNDIQKLVGKLNWASQIYPGIK\mQL^ 

RGTKALTEVIPLTEEAEIiELAENREIIiKEPVHGVYYDPSKDIilAEIQKQGQGQWTYQI 

YQEPFKNLKTGKYARMRGAHTNDVKQLTEAVQKITTESIVIWGKTPKFKLPIQKETWE 

TWWTEYWQATWIPEWEFWTPPLVKLWYQLEKEPIVGAETFYVDGAANRETKIjGKAGY 

VTNKGRQKVVPLTNTTNQKTELQAIYLALQDSGLEVNIVTDSQYALGIIQAQPDKSES 

EliWQIIEQLIKKEKVYLAWVPAHKGIGGNEQVDKLVSAGIRKIIiFLDGIDKAQDEHE 

KYHSNWRAMASDFNLPPWAKEIVASCDKCQLKGEAMHGQVDCSPGIWQLDCTHLEGK 

VILVAVHVASGYIEAEVIPAETGQETAYFLIiKIAGRWPVKTIHTDNGSNFTSATVKAA 

OTOAG I KQE FG I PYNPQSQGVVE SMNKELKKI IGQVRDQAEHLKTAVQMAVF I HNFKR 

KGGIGGYSAGERIVDIIATDIQTKELQKQITKIQNFRVYYRDSRNPLWKGPAKLIiWKG 

EGAWI QDNS D I KWPRRKAKI I RDYGKQMAGDDCVASRQDED " 

gene 4399. .4977 

/gene= "vif " 

CDS 4399 . .4977 

/gene="vif " 
/codon_start=l 
/products "vif protein" 
/protein_id« " AAA44202 . 1 " 
/db_xref = " GI : 3 2 638 9 " 

/translation="ME^^^WQVMIWQVDRMRIRTWKSLVKHHMYVSGKARGWFYRHHY 

ESPHPRISS E VH I PLGD ARL VI TT YWGLHTG ERD WHLGQGVS I EWRKKR YS TQ VD PEL 

ADQIilHLYYFDCFSDSAIRKALLGHIVSPRCEYQAGHNKVGSIiQYLAIJUyijITPKKIK 

PPIiPS VTKLTEDRWNKPQKTKGHRGSHTMNGH " 
gene 4917. .5153 

/gene= " vpr " 
CDS 4917.. 5153 

/gene= "vpr" 

/codon_start=l 

/product«"vpr protein" 

/protein_id= " AAA4 4 2 03 ,1 " 

/db_xref ="GI : 326390" 

/translations "MEQAPEDQGPQREPHNEWTLEIiLEELKNEAVRHFPRIWLHGIiGQ 
HIYETYGDTWAGVEAIIRIIiQQLLFIHFQNWVST" 
gene 5189.. 7779 

/gene="tat" 

CDS join(5189. .5403,7734. .7779) 

/gene="tat" 
/codon_start=l 
/products" tat protein" 
/protein_id= "AJ^44199, 1" 
/db_xre f = " GI : 3 2 63 8 6 " 

/translations "MEPVDPRIiEPWKHPGSQPKTACTNCYCKKCCFHCQVCFITKALG 
ISYGRKKRRQRRRPPQGSQTHQVSLSKQPTSQSRGDPTGPKE " 
gene 5328 . . 8008 

/genes "rev" 

CDS join (5328. .5403,7734. .8008) 

/genes "rev" 
/codon_start=l 
/products "rev protein" 
/protein_ids "AAA442 0 0.1" 
/db_xref s « GI : 3 2 63 8 7 " 

/ 1 rans la t ions " MAGRSGDSDEDLLKAVRLI KFLYQSNPPPNPEGTRQARRNRRRR 
WRERQRQIHSISERIIiSTYLGRSAEPVPLQLPPLERLTLDCNEDCGTSGTQGVGSPQI 
LVESPTVLESGAKE " 
intron 5404.. 7733 

/notes "tat, rev, nef subgenomic mRNA intron 2" 
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gene 5420 , . 5665 

/gene= "vpu" 
CDS 5420 . . 5665 

/gene= "vpu" 

/codon_start-l 

/product=:"vpu protein" 

/protein id=" AAA442 04 , 1 " 

/db_xref ="GI : 3 263 91" 

/ 1 rans la t ion= " MQPIQIAI VALWAI I lAIWWS IVIIEYRKILRQRKIDRLIDR 

lilERAEDSGNESEGEISALVEMGVEMGHHAPWDVDDL" 
gene 5580 . . 8150 

/gene= "env" 
CDS 5580 . . 8150 

/gene- "env" 

/codon_start=l 

/product ss" envelope polyprotein" 
/prot e in_id= " AAA442Q5 . 1 " 
/db_xref ="GI :326392" 

/translation^ " MRVKEKYQHLWRWGVmWGTMLLGMLMICSATEKLWVTVYYGVPV 

WKEATTTLFCASDAKAYDTEVHNWATHACVPTDPNPQEVVLVNVTENFl^ 

QMHEDIISIiWDQSIiKPCVKLTPIiCVSIiKCTDIiKNDTNTNSSSGRMIMEKGEIKNCSFN 

ISTSIRGKVQKEYAFFYKLDIIPIDNDTTSYTLTSCNTSVITQACPKVSFEPIPIHYC 

APAGFAILKCNNKTFNGTGPCTNVSTVQCTHGIRPWSTQLLl^GSIxAEEEVVIRSAN 

FTDNAKTIIVQLNQSVEINCTRPNNNTRKSIRIQRGPGRAFVTIGKIGNMRQAHCNIS 

RAKWNNTLKQIDSKLREQFGNNKTIIFKQSSGGDPEIVTHSFNCGGEFFYCNSTQLFN 

STWFNSTWSTKGSNNTEGSDTITLPCRIKQIINMWQEVGKAMYAPPISGQIRCSSNIT 

GLLLTRDGGNSNNESEIFRPGGGDMRDNWRSELYKYKWKIEPLGVAPTKAKRRWQR 

EKRAVGIGALFLGFLGAAGSTMGAASMTLTVQARQLLSGIVQQQNNLLRAIEAQQHLL 

QLTVWGIKQLQARILAVERYLKDQQIiLGIWGCSGKLICTTAVPWNASWSNKSLEQIWN 

NMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLEIJ3KWASLWNWFNITNW 

KliFIMIVGGLVGIiRIVFAVLSWNRVRQGYSPLSFQTHLPIPRGPDRPEGIEEEGGER 

DRDRSIRLVNGSLALIWDDLRSLCLFSYHRLRDLLLIVTRIVELLGRRGWEALKYWWN 

LLQYWSQELKNSAVSLLNATAIAVAEGTDRVIEWQGAYRAIRHIPRRIRQGLERILL" 

gene 8152,. 8523 

/gene= "nef " 

CDS 8152 . . 8523 

/gene= "nef" 

/note=" first expressed exon; premature termination" 
/number =3 

/codon__start = l 
/product = "nef protein" 
/protein_id= " AAA442 0 6 . 1 " 
/db_xref="GI: 3 263 93" 

/translation^: "MGGKWSKSSWGWPAVRERMRRAEPAADGVGAASRDLEKHGAIT 
SSNTAANNADCAWLEAQEEEEVGFPVTPQVPLRPMTYKAAVDLSHFLKEKGGLEGLIH 
SQRRQDILDLWIYHTQGYFPD" 
LTR 8441 . . >8932 

/note="3» LTR" 
repeat r egion 8894 . . >8932 

/note="R repeat 3' copy" 
ORIGIN About 120 bp downstream from HIVBHIOI. 

1 agctctctcg acgcaggact cggcttgctg aagcgcgcac ggcaagaggc gaggggcggc 
61 gactggtgag tacgccaaaa attttgacta gcggaggcta gaaggagaga gatgggtgcg 
121 agagcgtcag tattaagcgg gggagaatta gatcgatggg aaaaaattcg gttaaggcca 
181 gggggaaaga aaaaatataa attaaaacat atagtatggg caagcaggga gctagaacga 
241 ttcgcagtta atcctggcct gttagaaaca tcagaaggct gtagacaaat actgggacag 
3 01 ctacaaccat cccttcagac aggatcagaa gaacttagat cattatataa tacagtagca 
361 accctctatt gtgtgcatca aaggatagag ataaaagaca ccaaggaagc tttagacaag 
421 atagaggaag agcaaaacaa aagtaagaaa aaagcacagc aagcagcagc tgacacagga 
481 cacagcagtc aggtcagcca aaattaccct atagtgcaga acatccaggg gcaaatggta 



http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db==nuccore&id 



4/1/2008 



NCBI Sequence Viewer v2.0 



Page 5 of 7 



541 catcaggcca tatcacctag aactttaaat gcatgggtaa aagtagtaga agagaaggct 
601 ttcagcccag aagtaatacc catgttttca gcattatcag aaggagccac cccacaagat 
661 ttaaacacca tgctaaacac agtgggggga catcaagcag ccatgcaaat gttaaaagag 
721 accatcaatg aggaagctgc agaatgggat agagtacatc cagtgcatgc agggcctatt 
781 gcaccaggcc agatgagaga accaagggga agtgacatag caggaactac tagtaccctt 
841 caggaacaaa taggatggat gacaaataat ccacctatcc cagtaggaga aatttataaa 
901 agatggataa tcctgggatt aaataaaata gtaagaatgt atagccctac cagcattctg 
961 gacataagac aaggaccaaa agaacctttt agagactatg tagaccggtt ctataaaact 
1021 ctaagagccg agcaagcttc acaggaggta aaaaattgga tgacagaaac cttgttggtc 
1081 caaaatgcga acccagattg taagactatt ttaaaagcat tgggaccagc ggctacacta 
1141 gaagaaatga tgacagcatg tcagggagta ggaggacccg gccataaggc aagagttttg 
1201 gctgaagcaa tgagccaagt aacaaataca gctaccataa tgatgcagag aggcaatttt 
1261 aggaaccaaa gaaagatggt taagtgtttc aattgtggca aagaagggca cacagccaga 
1321 aattgcaggg cccctaggaa aaagggctgt tggaaatgtg gaaaggaagg acaccaaatg 
13 81 aaagattgta ctgagagaca ggctaatttt ttagggaaga tctggccttc ctacaaggga 
1441 aggccaggga attttcttca gagcagacca gagccaacag ccccaccatt tcttcagagc 
1501 agaccagagc caacagcccc accagaagag agcttcaggt ctggggtaga gacaacaact 
1561 ccccctcaga agcaggagcc gatagacaag gaactgtatc ctttaacttc cctcagatca 
1621 ctctttggca acgacccctc gtcacaataa agataggggg gcaactaaag gaagctctat 
1681 tagatacagg agcagatgat acagtattag aagaaatgag tttgccagga agatggaaac 
1741 caaaaatgat agggggaatt ggaggtttta tcaaagtaag acagtatgat cagatactca 
1801 tagaaatctg tggacataaa gctataggta cagtattagt aggacctaca cctgtcaaca 
1861 taattggaag aaatctgttg actcagattg gttgcacttt aaattttccc attagcccta 
1921 ttgagactgt accagtaaaa ttaaagccag gaatggatgg cccaaaagtt aaacaatggc 
1981 cattgacaga agaaaaaata aaagcattag tagaaatttg tacagaaatg gaaaaggaag 
2 041 ggaaaatttc aaaaattggg cctgagaatc catacaatac tccagtattt gccataaaga 
2101 aaaaagacag tactaaatgg agaaaattag tagatttcag agaacttaat aagagaactc 
2161 aagacttctg ggaagttcaa ttaggaatac cacatcccgc agggttaaaa aagaaaaaat 
2221 cagtaacagt actggatgtg ggtgatgcat atttttcagt tcccttagat gaagacttca 
2281 ggaagtatac tgcatttacc atacctagta taaacaatga gacaccaggg attagatatc 

23 41 agtacaatgt gcttccacag ggatggaaag gatcaccagc aatattccaa agtagcatga 

24 01 caaaaatctt agagcctttt aaaaaacaaa atccagacat agttatctat caatacatgg 
2461 atgatttgta tgtaggatct gacttagaaa tagggcagca tagaacaaaa atagaggagc 
2 521 tgagacaaca tctgttgagg tggggactta ccacaccaga caaaaaacat cagaaagaac 
2581 ctccattcct ttggatgggt tatgaactcc atcctgataa atggacagta cagcctatag 
2 641 tgctgccaga aaaagacagc tggactgtca atgacataca gaagttagtg gggaaattga 
2 7 01 attgggcaag tcagatttac ccagggatta aagtaaggca attatgtaaa ctccttagag 
2 761 gaaccaaagc actaacagaa gtaataccac taacagaaga agcagagcta gaactggcag 
2 821 aaaacagaga gattctaaaa gaaccagtac atggagtgta ttatgaccca tcaaaagact 
2 881 taatagcaga aatacagaag caggggcaag gccaatggac atatcaaatt tatcaagagc 

2 941 catttaaaaa tctgaaaaca ggaaaatatg caagaatgag gggtgcccac actaatgatg 

3 001 taaaacaatt aacagaggca gtgcaaaaaa taaccacaga aagcatagta atatggggaa 
3 061 agactcctaa atttaaacta cccatacaaa aggaaacatg ggaaacatgg tggacagagt 
3121 attggcaagc cacctggatt cctgagtggg agtttgttaa tacccctcct ttagtgaaat 
3181 tatggtacca gttagagaaa gaacccatag taggagcaga aaccttctat gtagatgggg 
3 241 cagctaacag ggagactaaa ttaggaaaag caggatatgt tactaacaaa ggaagacaaa 

33 01 aggttgtccc cctaactaac acaacaaatc agaaaactga gttacaagca atttatctag 
3 3 61 ctttgcagga ttcaggatta gaagtaaaca tagtaacaga ctcacaatat gcattaggaa 

34 21 tcattcaagc acaaccagat aaaagtgaat cagagttagt caatcaaata atagagcagt 
34 81 taataaaaaa ggaaaaggtc tatctggcat gggtaccagc acacaaagga attggaggaa 
3 541 atgaacaagt agataaatta gtcagtgctg gaatcaggaa aatactattt ttagatggaa 
3 601 tagataaggc ccaagatgaa catgagaaat atcacagtaa ttggagagca atggctagtg 
3 661 attttaacct gccacctgta gtagcaaaag aaatagtagc cagctgtgat aaatgtcagc 
3 721 taaaaggaga agccatgcat ggacaagtag actgtagtcc aggaatatgg caactagatt 
3 781 gtacacattt agaaggaaaa gttatcctgg tagcagttca tgtagccagt ggatatatag 
3 841 aagcagaagt tattccagca gaaacagggc aggaaacagc atattttctt ttaaaattag 
3 901 caggaagatg gccagtaaaa acaatacata cagacaatgg cagcaatttc accagtgcta 

3 961 cggttaaggc cgcctgttgg tgggcgggaa tcaagcagga atttggaatt ccctacaatc 

4 021 cccaaagtca aggagtagta gaatctatga ataaagaatt aaagaaaatt ataggacagg 
4 081 taagagatca ggctgaacat cttaagacag cagtacaaat ggcagtattc atccacaatt 
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4141 ttaaaagaaa aggggggatt ggggggtaca gtgcagggga aagaatagta gacataatag 
4201 caacagacat acaaactaaa gaattacaaa aacaaattac aaaaattcaa aattttcggg 
4261 tttattacag ggacagcaga aatccacttt ggaaaggacc agcaaagctc ctctggaaag 
4 321 gtgaaggggc agtagtaata caagataata gtgacataaa agtagtgcca agaagaaaag 
4 3 81 caaagatcat tagggattat ggaaaacaga tggcaggtga tgattgtgtg gcaagtagac 
4 441 aggatgagga ttagaacatg gaaaagttta gtaaaacacc atatgtatgt ttcagggaaa 
4 501 gctaggggat ggttttatag acatcactat gaaagccctc atccaagaat aagttcagaa 
4 561 gtacacatcc cactagggga tgctagattg gtaataacaa catattgggg tctgcataca 
4621 ggagaaagag actggcattt gggtcaggga gtctccatag aatggaggaa aaagagatat 
4 681 agcacacaag tagaccctga actagcagac caactaattc atctgtatta ctttgactgt 
4 741 ttttcagact ctgctataag aaaggcctta ttaggacaca tagttagccc taggtgtgaa 
48 01 tatcaagcag gacataacaa ggtaggatct ctacaatact tggcactagc agcattaata 
4 861 acaccaaaaa agataaagcc acctttgcct agtgttacga aactgacaga ggatagatgg 
4 921 aacaagcccc agaagaccaa gggccacaga gggagccaca caatgaatgg acactagagc 

4 981 ttttagagga gcttaagaat gaagctgtta gacattttcc taggatttgg ctccatggct 

5 041 tagggcaaca tatctatgaa acttatgggg atacttgggc aggagtggaa gccataataa 
5101 gaattctgca acaactgctg tttatccatt ttcagaattg ggtgtcgaca tagcagaata 
5161 ggcgttactc gacagaggag agcaagaaat ggagccagta gatcctagac tagagccctg 
5221 gaagcatcca ggaagtcagc ctaaaactgc ttgtaccaat tgctattgta aaaagtgttg 
5281 ctttcattgc caagtttgtt tcataacaaa agccttaggc atctcctatg gcaggaagaa 
5341 gcggagacag cgacgaagac ctcctcaagg cagtcagact catcaagttt ctctatcaaa 
54 01 gcagtaagta gtacatgtaa tgcaacctat acaaatagca atagtagcat tagtagtagc 
5461 aataataata gcaatagttg tgtggtccat agtaatcata gaatatagga aaatattaag 
5521 acaaagaaaa atagacaggt taattgatag actaatagaa agagcagaag acagtggcaa 
5581 tgagagtgaa ggagaaatat cagcacttgt ggagatgggg gtggagatgg ggcaccatgc 
5641 tccttgggat gttgatgatc tgtagtgcta cagaaaaatt gtgggtcaca gtctattatg 
57 01 gggtacctgt gtggaaggaa gcaaccacca ctctattttg tgcatcagat gctaaagcat 
5761 atgatacaga ggtacataat gtttgggcca cacatgcctg tgtacccaca gaccccaacc 
5821 cacaagaagt agtattggta aatgtgacag aaaattttaa catgtggaaa aatgacatgg 
5881 tagaacagat gcatgaggat ataatcagtt tatgggatca aagcctaaag ccatgtgtaa 
5941 aattaacccc actctgtgtt agtttaaagt gcactgattt gaagaatgat actaatacca 
6001 atagtagtag cgggagaatg ataatggaga aaggagagat aaaaaactgc tctttcaata 

6 061 tcagcacaag cataagaggt aaggtgcaga aagaatatgc atttttttat aaacttgata 
6121 taataccaat agataatgat actaccagct atacgttgac aagttgtaac acctcagtca 
6181 ttacacaggc ctgtccaaag gtatcctttg agccaattcc catacattat tgtgccccgg 
6241 ctggttttgc gattctaaaa tgtaataata agacgttcaa tggaacagga ccatgtacaa 

63 01 atgtcagcac agtacaatgt acacatggaa ttaggccagt agtatcaact caactgctgt 
6361 taaatggcag tctggcagaa gaagaggtag taattagatc tgccaatttc acagacaatg 
6421 ctaaaaccat aatagtacag ctgaaccaat ctgtagaaat taattgtaca agacccaaca 

64 81 acaatacaag aaaaagtatc cgtatccaga gaggaccagg gagagcattt gttacaatag 
6541 gaaaaatagg aaatatgaga caagcacatt gtaacattag tagagcaaaa tggaataaca 
6601 ctttaaaaca gatagatagc aaattaagag aacaatttgg aaataataaa acaataatct 
6661 ttaagcagtc ctcaggaggg gacccagaaa ttgtaacgca cagttttaat tgtggagggg 
6721 aatttttcta ctgtaattca acacaactgt ttaatagtac ttggtttaat agtacttgga 
6781 gtactaaagg gtcaaataac actgaaggaa gtgacacaat caccctccca tgcagaataa 
6841 aacaaattat aaacatgtgg caggaagtag gaaaagcaat gtatgcccct cccatcagtg 
6901 gacaaattag atgttcatca aatattacag ggctgctatt aacaagagat ggtggtaata 

6 961 gcaacaatga gtccgagatc ttcagacctg gaggaggaga tatgagggac aattggagaa 

7 021 gtgaattata taaatataaa gtagtaaaaa ttgaaccatt aggagtagca cccaccaagg 
7 081 caaagagaag agtggtgcag agagaaaaaa gagcagtggg aataggagct ttgttccttg 
7141 ggttcttggg agcagcagga agcactatgg gcgcagcgtc aatgacgctg acggtacagg 
72 01 ccagacaatt attgtctggt atagtgcagc agcagaacaa tttgctgagg gctattgagg 
72 61 cgcaacagca tctgttgcaa ctcacagtct ggggcatcaa gcagctccag gcaagaatcc 
7321 tggctgtgga aagataccta aaggatcaac agctcctggg gatttggggt tgctctggaa 
7381 aactcatttg caccactgct gtgccttgga atgctagttg gagtaataaa tctctggaac 
7 441 agatttggaa taacatgacc tggatggagt gggacagaga aattaacaat tacacaagct 
7501 taatacactc cttaattgaa gaatcgcaaa accagcaaga aaagaatgaa caagaattat 
7 561 tggaattaga taaatgggca agtttgtgga attggtttaa cataacaaat tggctgtggt 
7621 atataaaatt attcataatg atagtaggag gcttggtagg tttaagaata gtttttgctg 
7681 tactttctgt agtgaataga gttaggcagg gatattcacc attatcgttt cagacccacc 
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7741 tcccaatccc gaggggaccc gacaggcccg aaggaataga agaagaaggt ggagagagag 
7801 acagagacag atccattcga ttagtgaacg gatccttagc acttatctgg gacgatctgc 
7861 ggagcctgtg cctcttcagc taccaccgct tgagagactt actcttgatt gtaacgagga 

7 921 ttgtggaact tctgggacgc agggggtggg aagccctcaa atattggtgg aatctcctac 
7981 agtattggag tcaggagcta aagaatagtg ctgttagctt gctcaatgcc acagctatag 

8 041 cagtagctga ggggacagat agggttatag aagtagtaca aggagcttat agagctattc 
8101 gccacatacc tagaagaata agacagggct tggaaaggat tttgctataa gatgggtggc 
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8461 caacgaagac aagatatcct tgatctgtgg atctaccaca cacaaggcta cttccctgat 
8521 tagcagaact acacaccagg gccagggatc agatatccac tgacctttgg atggtgctac 
8581 aagctagtac cagttgagcc agagaagtta gaagaagcca acaaaggaga gaacaccagc 
8 641 ttgttacacc ctgtgagcct gcatggaatg gatgacccgg agagagaagt gttagagtgg 
87 01 aggtttgaca gccgcctagc atttcatcac atggcccgag agctgcatcc ggagtacttc 
8761 aagaactgct gacatcgagc ttgctacaag ggactttccg ctggggactt tccagggagg 
8 821 cgtggcctgg gcgggactgg ggagtggcga gccctcagat cctgcatata agcagctgct 
8 881 ttttgcctgt actgggtctc tctggttaga ccagatctga gcctgggagc tc 
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